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We explore the physical origin and robustness of constraints on the energy density in relativistic 
species prior to and during recombination, often expressed as constraints on an effective num- 
ber of neutrino species, AT e g. Constraints from current data combination of Wilkinson Microwave 
Anisotropy Probe ( WMAP) and South Pole Telescope (SPT) are almost entirely due to the impact 
of the neutrinos on the expansion rate, and how those changes to the expansion rate alter the ratio 
of the photon diffusion scale to the sound horizon scale at recombination. We demonstrate that 
very little of the constraining power comes from the early Integrated Sachs- Wolfe (ISW) effect, and 
also provide a first determination of the amplitude of the early ISW effect. Varying the fraction of 
baryonic mass in Helium, Yp, also changes the ratio of damping to sound-horizon scales. We discuss 
the physical effects that prevent the resulting near-degeneracy between N e s and Yp from being a 
complete one. Examining light element abundance measurements, we see no significant evidence for 
evolution of N c s and the baryon-to-photon ratio from the epoch of big bang nucleosynthesis to de- 
coupling. Finally, we consider measurements of the distance-redshift relation at low to intermediate 
redshifts and their implications for the value of N e ff. 



I. INTRODUCTION 

High-resolution observations of the cosmic microwave 
background (CMB) temperature anisotropy are provid- 
ing a precise measurement of the damping tail of CMB 
power spectrum, shedding light on the physical condi- 
tions during recombination and back into the radiation 
dominated era. The measurements have revealed some- 
what less fluctuation power at small angular scales than 
expected in the standard cosmological model pUI]. Many 
papers [3TfTU] have considered the possibility that this 
deficit of power is due to extra (dark) relativistic species 
[53] . such as nearly massless sterile neutrinos. Con- 
straints are usually expressed in terms of an effective 
number of neutrinos, N g [55] , 

Note that these constraints apply to any weakly in- 
teracting or non-interacting species that is relativistic at 
recombination. Hypothesized additional species include 
sterile neutrinos, sub-eV mass axions as in, e.g., Hannes- 
tad et al. [TT] and those arising in many other extensions 
of the standard model such as in Fischler and Garcia [12] . 

Interest in the number of light degrees of freedom is 
further stimulated by i) recent inferences of the primor- 
dial Helium abundance which are larger, and with larger 
uncertainties than previous analyses [13 15 , ii) evidence 
for additional (sterile) neutrino species from laboratory- 
produced neutrinos [16j and reactor-produced neutrinos 
[17] and iii) a slight tension between determinations of 
distance vs. redshift at very low redshifts (essentially 
measurements of Ho [15] ) and those at low to interme- 
diate redshifts that use the baryon acoustic oscillation 
(BAO) feature in the galaxy power spectrum as a CMB- 
calibrated standard ruler [23 [20] • 

Given this interest, and impending improvements to 
the damping tail measurements from the SPT [4J (Kll 
hereafter) and the Planck satellite, a pedagogical expo- 



sition of the impact of N e g on the CMB is quite timely. 
The relevant physics is beautifully simple and deserves 
to be as well (and as broadly) understood as the con- 
straints on N e s from big bang nucleosynthesis (BBN) 
considerations. Focusing on the physics behind the con- 
straints also allows one to understand their more general 
implications. The only important assumptions about the 
relativistic degrees of freedom that go into the CMB pre- 
dictions is that their interactions are negligible and they 
are massless. 

Despite the analyses provided by Hu and White [21] 
and Bashinsky and Seljak [22] we find confusion in the 
literature about why increasing N e s reduces small-scale 
anisotropy. The sensitivity to N e g from high-resolution 
CMB observations is due to the impact of the mean rel- 
ativistic energy density on the Universe's expansion rate 
prior to (and during) the epoch of photon-baryon decou- 
pling. As we demonstrate below, neutrino perturbations 
do not play a significant role [56] . nor do anisotropics 
induced after decoupling. To demonstrate the insignifi- 
cance of the early Integrated Sachs- Wolfe effect, we ar- 
tificially introduce a scaling of its physical value by a 
parameter A e igw and examine the constraints on iV e fj 
that follow from marginalizing over A c isw- 

The constraints on N e g are model dependent; there are 
other ways to extend the standard cosmological model 
to suppress small-scale power in the cosmic microwave 
background. We discuss, in particular, departures from 
a power law for the primordial perturbation spectrum, 
and allowing the fraction of baryonic mass in Helium, 
Yp, to be a free parameter. 

We also consider the sensitivity of inferences of N e g 
from CMB data to assumptions about BBN and measure- 
ments of light-element abundances. Allowing Yp to vary 
freely introduces a degeneracy that completely changes 
the mechanism by which N e ff is constrained by CMB 
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data. We comment on the origin of constraints in the 
iVeff - Yp plane. 

In Section II we review the analytic explanation for the 
origin of constraints on N e s from CMB observations and 
demonstrate its quantitative effectiveness in understand- 
ing constraints from current data. We also examine the 
model dependence of constraints on AT c ff. In Section III 
we consider the impact of the early ISW effect on N e s 
constraints and we discuss our results and conclude in 
Section IV. 



II. HOW CMB OBSERVATIONS CONSTRAIN 
COMPONENT DENSITIES 

Let us review the sensitivity of the CMB power spec- 
trum to the densities today of baryons, pb, cold dark 
matter plus baryons, p m , and the dark energy, pa, all 
within the context of the ACDM model [ST]. For more 
details than we give here, see (23] |24] ■ In the course of 
our review, we will identify directions in parameter space 
that cause large changes in probability density. When we 
do study variations in N e g we will do so along orthogonal 
directions. 



A. Sensitivity to the Non-neutrino Components 

The dependence of the CMB on pb arises from 
the dependence of the equation-of-state of the pre- 
recombination plasma on the fraction of its energy den- 
sity that comes from baryons. Increasing the number 
of baryons per photon decreases the plasma's pressure 
to density ratio (P/p). The resulting shift in the equi- 
librium point between gravitational and pressure forces 
alters the ratio of even peak heights to odd peak heights. 
We know the energy density of photons very well from 
measuring the CMB photon spectrum [35], and can thus 
infer pb from the ratio of even to odd peak heights. 

The dependence on p m arises from the sensitivity of the 
evolution of a Fourier mode amplitude to the fraction of 
the energy density contributed by non-relativistic matter 
when the mode's wavelength is equal to the Hubble ra- 
dius. This ratio depends on X/teq where A is the comov- 
ing wavelength, teq = H^g/aEQ defines the comoving 
Hubble radius at the time when the matter density equals 
the radiation density (an epoch denoted by 'EQ') and a is 
the scale factor parameterizing the expansion of the Uni- 
verse. The amplitude of a mode projecting to angular 
scale 8 depends on O/0 E q where #eq = »"eq/ Da because 
@/@eq = A/Veq. Since the amplitude is a strong function 
of this ratio, there is strong sensitivity to #eq and there- 
fore to zeq (1 + z = 1/a) since, assuming the dark en- 
ergy is a cosmological constant, #eq = /(O m )/^/l + z E q 
where 7(f2 ro ) is a very slowly- varying function of f£m |58j. 

Thus the CMB power spectrum is sensitive to 1+zeq = 
Pm/Pr- If w ^ assume the standard radiation content 
(N e g = 3.046, p r = p^+pv), then a constraint on 1 + zeq 



directly constrains p m . However, if we are allowing N e g 
to vary, we should study its effects at fixed 1 + zeq since 
this quantity is well constrained by the data. Prior to the 
CMB damping scale measurements, there were already 
hints of high N c s by combining the 1 + zeq constraint 
from WMAP with late-time observables sensitive to the 
matter density [26 . 

We now consider p\. The angular scales of the acous- 
tic peaks are highly sensitive to the angular size of the 
sound horizon, 6 S — t s /Da- Thus, 6 S is very precisely de- 
termined by the data. Given r s , we could infer Da- In a 
universe with zero mean spatial curvature, Da = J cdt/a 
from the time of last scattering to today, and depends 
only on p m and p^. We do know r s , to some degree, 
from our determination of pb and p m as described above. 
These densities determine the history of the sound speed, 
c. [59j and the expansion rate, allowing us to calculate 
r s since it depends on no other parameters in the six- 
parameter ACDM model. This determination of r s al- 
lows for a constraint on p\. 



B. The Effect of Relativistic and Dark Degrees of 
Freedom 

Changing N c g only slightly alters the above story 
about the origins of the parameter constraints. We ex- 
pect inferences of p;,, 1 + ^eq and 9 S to be nearly un- 
affected. In the top panel of Fig. [I] we thus hold these 
parameters fixed while varying N c g from 2 to 6. We hold 
zeq fixed by increasing the density of cold dark matter 
as we increase N e f{. We keep 9 S fixed by changing pa to 
adjust Da- 

As can be seen in Fig. [T] increasing N D g along the cho- 
sen direction in parameter space, makes very little dif- 
ference at low £ (exactly as intended) and an increasing 
difference at higher £. A similar exercise was performed in 
[27] . who identified the same parameters to be held fixed, 
but then ascribed the relative drop in power toward small 
scales as due to a post-decoupling effect known as the 
"early integrated Sachs- Wolfe effect." This explanation 
was summarized and cited in [5]. We claim, in contrast, 
that these differences at high £ are almost entirely due 
to increased Silk damping, caused by the increased ex- 
pansion rate. To demonstrate that the variation is not 
predominantly due to ISW, in the central panel we have 
normalized the spectra at i = 400 where the ISW effect is 
negligible; we see that the high-£ variation is only slightly 
reduced. 

Temperature anisotropies on scales smaller than the 
photon diffusion length are damped by the diffusion, a 
phenomenon known as Silk damping. Diffusion causes 
the drop in power toward high £ and makes the power 
spectrum sensitive to the angular scale of the diffusion 
length, 9d- To second order in A m f p /A, where A m f p is the 
photon mean free path, the temperature fluctuations are 
suppressed by exp[— (2rd/A) 2 ] where the mean squared 
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diffusion distance at recombination is 



...2 

1 d 



da 



R 2 



6(1 + R 2 ) 



(1) 



where n e is the number density of free electrons, o~t is the 
Thompson cross-section, a* is the scale factor at recombi- 
nation (defined below) and the factor in square brackets 
is due to the directional and polarization dependence of 
Thompson scattering [28j [29] . Although Eq. [I] is only an 
approximation to the diffusion length, it allows an an- 
alytic understanding of the dependence of this diffusion 
length on model parameters [21] . 

If we approximate independent of H, then r d oc 

jj-0.5 This j s as expected for a random walk process: 
the distance increases as the square root of time. In- 
creasing H (which happens when we increase N c tf) leads 
to smaller r<j which would decrease the amount of damp- 
ing. Why do we see, in Fig. [T] the damping increase as 
N e g increases? 

The answer has to do with how r s and Da change to 
keep 9 S fixed despite the increased expansion rate. The 
comoving sound horizon is given by 



,. dt/a 



da 



a 2 H 



(2) 



Since r s cx 1/H, it responds even more rapidly to changes 
in H than is the case for rd- To keep 9 S fixed at the 
observed value, Da must also scale as l/H. Since Da 
decreases by more than would be necessary to keep 9d 
fixed, 9d increases which means the damping is increased. 

To look at it another way, if we knew Da perfectly, 
we could use r s to determine H prior to recombination. 
But we do not know Da, largely because we do not know 
the value of the cosmological constant, or more generally 
the density of the dark energy as a function of the scale 
factor. Instead, we can use the two scales together to 
form a ratio that is sensitive to H, with no dependence 
on Da'- 8 d /6 s =r d /r s <xH - 5 . 

Does this explanation hold together quantitatively? To 
demonstrate that what we are seeing in the power spec- 
trum actually is increased Silk damping (at fixed 9 S ) we 
experiment with also fixing 6d as N D ff increases. The 
bottom panel of Fig. [I] shows how the angular power 
spectrum responds to the same variations in N e g, only 
now taken at constant 8d as well. When we remove the 
9d variation, the impact of the iV e ff variation almost en- 
tirely disappears. We conclude that the variations we 
are seeing in the top panel are indeed due to the impact 
of iVeff on the amount of Silk damping. A very similar 
demonstration was provided by [22j . 

To keep 9d fixed as N e g varies, we varied a parameter 
whose sole impact is on the number density of electrons: 
the primordial fraction of baryonic mass in Helium, Yp. 
Even as early as times when 99% of the photons have yet 
to last scatter, Helium, with its greater binding energy 
than Hydrogen, is almost entirely neutral. Thus n e = 
X e (n p + nji) = X e n]y(l — Yp) where the first equality 
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FIG. 1: Top panel: WMAP and SPT power spectrum mea- 
surements, and theoretical power spectra normalized at £ = 
200. The black (central) curve is for the best-fit ACDM + N cS 
model assuming BBN consistency. The other model curves 
are for N e s varying from 2 to 6 with p b , 8 S , and zeq held 
fixed. Larger N e s corresponds to lower power. Central panel: 
Same as above except normalized at £ — 400 where the ISW 
contribution is negligible. We see most of the variation re- 
mains. Bottom panel: The same as the central panel except 
we vary Yp to keep 6d fixed. The lack of scatter in these spec- 
tra compared to those in the middle panel demonstrates that 
the effect of N e g on small-scale data is largely captured by its 
impact on the damping scale. We can also begin to see more 
subtle effects of the neutrinos, most noticeably a phase shift 
in the acoustic oscillations 1221. 



defines X e and we have kept (and thus pi,) fixed. The 
limit of integration in the above equations for r s and 
is only slightly affected by changing Yp and thus r s is 
largely unaffected. However, the damping length scales 
with Yp as r d oc (1 - F P )~ ' 5 . 

From our analysis one finds that rd/r s oc (1 + 
fv) ' 25 / \A — Yp where /„ = p^/p 7 is proportional to 
iVeff ■ The first factor arises because increasing H at fixed 



zeq means H 2 oc (l+/„). Thus as N c g is varied, we know 
how to change Yp to keep rd/r s (and hence 9d/0 s ) fixed. 
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FIG. 2: Probability distribution of N c s using just WMAP 
data, WMAP + SPT, and WMAP + constraints on 8 d /6 s 
from WMAP + SPT (see Tab. [if. 

However, the above analysis requires a small correc- 
tion for two reasons. First, increased expansion, even if 
we keep n e (a) fixed, decreases a* because we define a*, 
following [30], such that the optical depth to Thomson 
scattering from here to a* is unity. Second, recombina- 
tion is not a process that occurs in chemical equilibrium. 
As emphasized in |31j , increasing the expansion rate leads 
to an increase in n e {a). By numerically studying these 
effects, which partially cancel each other, we find that 
r d /r s oc (1 + /„) m /Vl - Y P with m = 0.28 rather than 
0.25. 

Note that when varying N e g in Fig. [T] we also vary 
Yp as is expected for standard assumptions about BBN, 
as will be explained below. Following BBN consistency 
(as opposed to keeping Yp fixed) increases the damping 
effect by about 30%. 

We should mention that neutrino perturbations do al- 
ter the amplitude of the power spectrum at I > 200 by 
a (nearly) constant factor J33] 132] • The breaking of the 
N e s, p m degeneracy in WMAP data at low N e s is due 
to the impact of neutrino perturbations, and this is the 
effect that allowed for an indirect detection of these per- 
turbations as reported in [33J [M] . 

However, with the inclusion of small-scale data, the 
perturbations have lost their significance. In Fig. [2] we 
demonstrate that the N e g constraint from WMAP + SPT 
is well approximated by combining the WMAP7 data 
with the information on 8 d /9 s from WMAP +SPT. 

Fig. [3] provides another way of seeing the importance 
of 8d/9 s to the N e g constraint. From the color coding 
one can see that lines of constant 9d/0 s run along the 
major axis of the probability contours. Further, one can 
see that the BBN consistency line cuts nearly perpen- 
dicularly across these lines. This feature explains why 
the errors on N e g are about 30% smaller if one assumes 
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FIG. 3: The joint likelihood of N e g and Y P with 68% and 
95% confidence contours. Each scattered point in the figure 
shows one element of the Markov chain with color coding the 
corresponding value of 0d/0 s . The purple dashed line is the 
BBN consistency line. 



BBN consistency rather than fixed Yp. If we abandon 
BBN consistency and allow Yp to vary freely, then N e g 
is allowed to vary along the major axis of the probability 
contours and the constraint on N e g loosens considerably, 
as described in more detail below. 



C. Constraining jV e a with Yp free 

We do not have a complete analytic understanding of 
the closing of the contours on the major axis (as opposed 
to the minor axis) in Fig. [3j We can turn though to the 
lowest panel of Fig. 1 to see that at fixed 8d/0 s there 
is indeed some remaining variation to the power spectra 
as iV e ff varies. At least some of this variation is due to 
the difference in acoustic oscillation phase shift that one 
gets for neutrinos, relative to the same energy density in 
photons, due to their free streaming [2"2"] . 

Another effect important for breaking the degeneracy 
between Yp and N c g is due to high baryon fraction. In 
our above analysis we assumed that zeq is a fixed con- 
stant. However, this assumption will break down when 
N e g decreases to lower values, as it can do if Yp is al- 
lowed to vary freely, while the baryon density remains un- 
changed. As N c g decreases, to keep zeq fixed, uj m would 
decrease, thus driving up the baryon fraction, Wb/w m . 
Effects due to the high baryon fraction make it impossi- 
ble at sufficiently low N e g to find a value of zgq , u/f, and 
Wfc/^m that reproduce the measured power spectra. In 
an attempt to best accommodate the data, the region of 
high likelihood departs from constant zeq, as can be seen 
in the upper panel of Fig. [4] which shows probability con- 
tours in the (l + / v ) - f2(,/f2 ro plane. The lines are lines of 
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FIG. 4: Probability contours (68% and 95%) for the case of 
BBN consistency (purple), free Yp (blue) and free Yp and 
Asisw (red). Upper panel: baryon fraction, fi(,/fi m vs. (1 + 
/„). The solid line is for 100 Q b h 2 = 2.267 and z E q = 3200. 
Vayring zeq to 2843 (3557) we get the right (left) dashed 
line. One can see the constant zeq assumption breaks down 
at high baryon fraction. Lower panel: the N e « — zeq plane. 
Here we can see more directly how the extra freedom degrades 
the inerence of zeq- 

constant zeq and Wj,. As extra freedom is allowed in the 
model, allowing N e g to extend to lower values, we can 
see the expected departure of the probability ridge from 
the line of constant zeq and Ub, mainly due to variation 
of z EQ . 

A high baryon fraction alters the acoustic dynamics 
for modes entering the horizon during matter domina- 
tion, because a large fraction of the matter sourcing the 
gravitational potentials is feeling pressure support. This 



pressure support causes potentials to decay, boosting the 
amplitude of the fluctuations as happens with the "radi- 
ation driving" in the radiation-dominated era. At fixed 
uji,, increasing ujb/u) m boosts the height of the first peak 
relative to the second. The existence of this radiation- 
driving-like effect in the matter-dominated era impacts 
the ability to constrain zeq- One can see in the lower 
panel of Fig.Hlhow the uncertainty in zeq increases when 
iV e ff is allowed to take on smaller values (as happens when 
allowing Yp to vary freely). The effect is even greater 
when allowing further freedom in the model by allowing 
^eisw to vary, a parameter we will introduce below. 



III. CONSTRAINTS ON N eB 

Here we present constraints on N c g from current data. 
We begin with our baseline estimates, which are from 
CMB data alone, assuming the six-parameter ACDM 
model extended by one to allow free N e g, with Yp de- 
termined by a BBN consistency relation to be discussed 
below. 

We also consider departures from our baseline assump- 
tions. We let Yp vary away from the standard BBN 
relationship, let dn s /d\rik vary away from what is ex- 
pected from inflation and allow the early ISW effect to 
be greater or smaller than expected, by introducing an 
artificial scaling parameter, A ig\y- 

Finally, we consider the influence of low-redshift dis- 
tance measurements on inferences of N e g. 



A. Baseline Constraints 

We show our baseline constraints on N e g in the first 
row of estimates of Table [TJ which is essentially the same 
with what Keisler et al. [I] have found for the same 
model. 



B. Sensitivity to BBN Assumptions 

The standard assumption is that the baryon-to-photon 
ratio and N e g are unchanged from BBN through decou- 
pling. However, many processes can change this situa- 
tion, such as energy injection into the plasma after BBN 
which would reduce both ojb and N e g, or a decay of a 
massive dark species into a relativistic dark species af- 
ter BBN, which would increase N e g while leaving ojb un- 
changed. Here we relax the standard assumption and 
distinguish the variables with superscripts DEC (for de- 
coupling) and BBN. 

First we consider the extreme of making no assump- 
tions about BBN and using no light-element abundance 
data to constrain the BBN quantities. In practice this 
simply means allowing Yp to be free. As one would ex- 
pect from our earlier discussion, allowing Yp to be a free 



6 



TABLE I: Sensitivity of inferences of N e g and u)t to BBN assumptions 



Assumptions 


Data 


100 ^ BBN 


100w° EC 


100 Auj b 


N BBN 


"off 


AAf off 




Wd d /9„ 


Baseline 
BBN ^ DEC 
BBN ^ DEC 
BBN ^ DEC 

, BBN , ,DEC 
U) b — Ul b 


CMB 

CMB 
CMB+D/H+Y/ 
CMB+D/H+Y/ 
CMB+D/H PC 


2.29 ± 0.11 
9 16+ 011 


2.27 ± 0.05 
2.26 ± 0.05 

2.26 ± 0.06 

2.27 ± 0.05 
2.27 ± 0.05 


u.iuo_ 124 


3.80 ± 0.26 
3.09 ± 0.21 
3.31 ± 0.58 


3.86 ± 0.62 

3 2 a + 109 

3 76+ 77 
3 - 93 ±0.7 9 5 

3.93 ± 0.74 


843+ 829 

617+0-896 
uu± ' -0.841 


0.148 ± 0.012 
140+ ' 018 

0.147 ± 0.014 
149+0- 015 

0.149 ± 0.014 


1.569 ± 0.015 
1.569 ± 0.015 
1.567 ± 0.015 

1.566 ± 0.015 

1.567 ±0.015 



The PC superscript indicates the Pcttini and Cooke (2012) inference, A indicates Aver et al. (2011) and P indicates Peimbert et al. 
(2007). The difference Au 4 = u° EC - w BBN , and AiV ef£ = N°f c - N BBN . 



parameter greatly relaxes the constraints on N^p c be- 
cause of its impact on 6d/9 s . This is shown in the second 
row of Table D 

If we assume standard BBN, we can use measurements 
of the abundance of Deuterium relative to Hydrogen, 
D/H, and Y P to determine w BBN and . Simha 

and Steigman 35 provided fitting formulae for the de- 
pendence of Yp and D/H on these quantities. Here we 
present revised ones [60 , that incorporate updates in nu- 
clear reaction rate estimates and a neutron lifetime esti- 
mate. They are 

Y P = 0.2381 ± 0.0006 + [r) W + 100(5 - 1)] /625 (3) 



and 

10 5 D/H = 2.60(1 ±0.06) 
where 



(i 



r^- 6(5-1) 



1.6 



(4) 



r] W = 10 10 n fc /n 7 = 273.9cj 6 + 100(5 - 1) and 

S = [l + 7(7V of f- 3.046)/43] 1/2 . (5) 

To calculate Yp as a function of uib and N e g we use the 
default CosmoMC option which is an interpolation over 
tables produced using PArthENoPE vl.00 described in 
Pisanti et al. |36| . only using Eq. [3] for values outside 
the bounds of the tables for robustness but very rarely 
reached. To calculate D/H we use Eq. [4] 

For the measurements of light element abundances we 
mostly follow Nollett and Holder [37]. They assumed, 
from a compilation of D/H measurements (see Pettini 
et al. [35], Fumagalli et al. [3H] and references therein) 



\og(D/H) = -4.556 ± 0.034. 



(6) 



We also consider a very recent, and significantly more 
preicse, D/H measurement by Pettini and Cooke [ID] of 



\og{D/H) = -4.596 ± 0.009. 



(7) 



Note that signifcant uncertainty arises from nuclear re- 
action rate uncertainty. When using D/H measurements 
we include the 6% error in Eq.[4]and add it in quadrature 
with the measurement error. 

For Helium Nollett and Holder [37 considered two dif- 
ferent inferences 



Y P = 0.2573 ± 0.0033 (Aver) 
Yp = 0.2477 ± 0.0029 (Peimbert) 



from Aver et al. [TS] and Peimbert et al. [H] respectively; 
we will do the same. 

Including the light element abundance measurements 
to help constrain Yp reduces the uncertainty in N^p c , 
though it remains comparatively large. Interestingly, the 
lower Yp measurement, which is the only one of the two 
consistent with A rl | f BN = 3, leads to a slightly higher 
inference. This is because lower Yp values need 
higher N e g to get the same 9d/0 s . 

Pettini and Cooke [40: took an inference of the baryon 
density from Keisler et al. 0], calculated assuming N e g = 
3.046, combined it with their D/H measurement, and the 
above D/H fitting formula, and found = 3.0 ± 

0.5. Thus they show that their data are consistent with 
the combination of CMB data and the assumption of 

iVBBN = ^DEC = 3 04 g 

We performed a similar, but different, exercise where 



we set wP EC 



,BBN 



and estimated both N e g values 



(8) 



simultaneously from the CMB and D/H data. There is 
a small correlation between and N e g as inferred from 
the CMB data which leads to increased uib inference when 
N e ff is allowed to vary. The net result is that our exercise 
leads to a higher value, iV e BBN = 3.3 ± 0.6, with N^ G = 
3.93 ±0.74. 

We note that our analysis, in which we allow the BBN 
quantities to be different from the DEC quantities, is for 
a very general scenario and thefore missing features that 
may be important for specific scenrios. For example, in 
Eggers Bjaelde et al. [H] a scenario was considered in 
which Neg increases after BBN due to the decay of a 
fraction of the dark matter into dark radiation. With 
such a specific choice, one can include in the calculation 
the details of how this conversion happens over time, and 
the differences in the dark radiation perturbations from 
the case of thermally produced neutrinos. 



C. Sensitivity to primordial power spectrum 
assumptions 

Another straightforward way to reduce small-scale 
power is to alter the primordial power spectrum. For 
the usual power-law assumption, the exponent n s is suf- 
ficiently well determined by low-£ data that it cannot 
mimic the damping effect of N c f[. However, if we allow 
for n s to have a logarithmic scale dependence so that 
n s (k) = n s (k*) + h\(k/k*)dn s /d\nk for some constant 
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TABLE II: Sensitivity of inferences of N e « to low-redshift distance measurements, dn 3 /d\nk and the arbitrary 
amplitude of the early ISW effect 



Assumptions 


Data 


100wf EC 


n dec 


dn s / d In k 


j4oisw 


n A 




ioe d /e a 


Baseline 


CMB 
CMB+BAO 
CMB+fli, 
CMB+BAO+flo 


2.27 ± 0.05 

2.25 ± 0.05 

2.26 ± 0.04 
2.26 ± 0.04 


3.86 ± 0.62 
3.83 ± 0.60 
3.73 ± 0.44 
3.97 ±0.41 






0.737 ± 0.025 
0.707 ± 0.012 
0.733 ± 0.021 
0.708 ± 0.011 


0.148 ± 0.012 
0.154 ± 0.012 
0.147 ± 0.011 
n i ^fi+ 009 

U ' 1OD -0 .008 


1.569 ± 0.015 
1.569 ± 0.015 
1.566 ± 0.012 
1.572 ± 0.011 


dn s j d In k free 
dn s j d In k free 
A isw free 


CMB 

CMB+BAO+ffo 
CMB 


2.21 ± 0.07 
2.25 ± 0.04 
2.29 ± 0.08 


2 97+ 91 
3.76 ± 0.43 
3.92 ± 0.65 


-0.025 ± 0.020 
-0.015 ± 0.013 


0.979 ± 0.055 


0.704 ± 0.041 
0.706 ± 0.011 
0.741 ± 0.028 


0.138_ 012 
0.153 ± 0.009 
0.149 ± 0.012 


1 547+ " 022 

-L.J1I -0.021 

1.566 ± 0.012 
1.568 ± 0.015 



1.3 
1.2 

1.1 

1.0 f- 

0.9 

0.8 
0.7 
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FIG. 5: Probability contours (68% and 95%) in the Aasw — 
iV e ff plane assuming BBN consistency (green, tighter con- 
tours) and free Yp (red, broader contours). We see in the 
BBN-consistency case that the constraint on N e s persists de- 
spite the freedom in the amplitude of the elSW effect. For 
the case of free Yp the elSW effect is important for constrain- 
ing N e fs; letting the amplitude vary freely degrades the con- 
straints on N e g somewhat, especially at low N e g. 



dn s /d\nk then the resulting power spectra can better 
mimic the effects of N e g. As a result, if we marginalize 
over dn s /d\nk, we increase the uncertainty in N e g, as 
can be seen in Table ITT1 



D. Importance of the early ISW Effect 

The anisotropy field today can be written as an in- 
tegration over perturbation variables on the past light 
cone. Doing so can be useful both computationally [43] 
and for analytic understanding. Writing the Fourier 
and Legendre-transformed radiation transfer function as 
such an integral (ignoring the polarization-dependence of 



Thomson scattering for simplicity) one gets 
rvo 

e,(fc) = / d V g( v )[e (k, v ) + ^(k, v )} Jl [k( m - v )} 

Jo 



'"^W^TidM'Io-ri)] (9) 



drf 



'/o 



+ / drjf{z(r,),A eISW )e T ¥(fc, r?) - $(fr, 77) 
Jo L 

Xjl [Hvo - V)] ■ 

See Dodelson [U] for definitions. The final integral is 
the so-called integrated Sachs- Wolfe (ISW) effect. The 
other terms are only important when the visibility func- 
tion, g(r]) is non-zero, whereas the ISW term gets con- 
tributions along the whole line of sight between here and 
recombination where r starts to get very large. The grav- 
itational potential time derivatives are zero for a Universe 
dominated by cold dark matter. They are significantly 
non-zero at early times as the radiation density is still 
a significant contributor to the expansion rate (the early 
ISW effect), and then again at late times when dark en- 
ergy becomes important (the late ISW effect). 

In Eq.[9]we have introduced the parameter A e isw so we 
can artificially vary the amplitude of the early ISW effect. 
The function f(z(rj),A e isw) = ^eisw when the redshift, 
z > 30 and otherwise equals one. This use of Aiisw is 
very similar to the use of A\ ens to artifically change the 
amplitude of the lensing potential power spectrum alter- 
ing the CMB power spectrum. We might have chosen / 
to go as the square root of A:isw so that ^4 c isw is scal- 
ing the ISW power (just as Ai cns scales up the lensing 
power). However, a significant impact of the ISW term 
in Eq. [9] comes from its correlation with the other terms 
in the equation. Thus there is no way to make the total 
contribution of elSW to the power scale as a single power 
of A isw- 

The greater the amount of radiation around, relative 
to matter, at recombination, the greater the amplitude 
of the early ISW effect. Thus increasing N e s (with oj m 
held fixed) would increase the amplitude of the early ISW 
effect. This relationship between N e g and the early ISW 
effect led Bowen et al. [37] to cite the ISW effect as the 
reason the CMB is sensitive to N e g. 

To quantitatively investigate the impact of the ISW ef- 
fect, we could perform the exercise of turning it off arti- 
ficially. But turning off the ISW effect would so radically 
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FIG. 6: Allowing A c isw to vary stretches out the contours 
in the Yp plane in the direction not constrained by 8d/9 a . If 
we assume BBN consistency though (the dashed line), then 
it has very little impact on the constraint on N e s. Along the 
direction of constant 9d/8 a , two samples, a) and b), in the two 
Markov chains are picked up with almost identical likelihood 
values. The parameters of interest of these two samples are 
listed, and the corresponding power spectra are plotted in 
Fig. [7] with the A c isw modulation turned off and on. 
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FIG. 7: The power spectra of the two samples shown in Fig|6] 
with the A c isw modulation turned off (on) in the left (right) 
panel. In this figure we see explicitly how allowing A c isw to 
be artificially free weakens the lower bound on N e a in the case 
that Yp is free. 



change the first peak (dropping it in power by 28%) that 
we instead chose to investigate by letting A e isw be a free 
parameter. If the ISW effect is playing an important role 
in constraining N e g then if we let it be a free parame- 
ter, those constraints will degrade and we will also see a 
strong correlation between the two parameters. 

We show probability contours in the iVeff-Asisw plane 
in Fig.[5]for the case that Yp follows the BBN-consistency 



relation and the case that Yp is free. For the BBN- 
consistency case, the constraint on N D g degrades by less 
than 10%. But when Yp is allowed to vary freely, the 
9d/9 s constraint can no longer determine N e g because of 
the degeneracy with Yp. In this case, the region of the 
spectrum affected by ISW is more important for deter- 
mining N e g so we see a significant degeneracy between 
A e isw and N eS . 

From Fig. [6] we can see that marginalizing over A;isw 
does indeed loosen up the N e g, Yp contour. One can see 
explicitly in Fig. [7J how letting Aiisw vary allows lower 
N e g and higher Y P than would otherwise be possible. 

We also see from Fig. [6] that marginalizing over A isw 
does little to expand the minor axis of the contours. As 
we have emphasized, the constraint along this axis is due 
to the constraint on 64/ 6 s . Along the major axis the ISW 
effect does play a role in breaking the N c g — Yp degener- 
acy. Other effects important in breaking this degeneracy 
include those coming from neutrino perturbations |22j 
and high baryon fraction as discussed in Section |II C| 



IV. IMPLICATIONS FOR LOW-REDSHIFT 
DISTANCE MEASUREMENTS 

We now consider the implications of CMB measure- 
ments, in the context of variable N e g, for low-redshift 
measurements . 

First let's consider the effect on the expansion rate at 
low redshift for the ACDM + N e g cosmology. Increasing 
N e g, with the resulting increase in u) m oc (1 + /„) (to 
keep zeq fixed) increases the expansion rate through the 
matter-dominated era. Further, in order to adjust Da 
to keep 9 S fixed (given the decrease in r s ), the energy 
density in the cosmological constant, lu\, must increase 
as well. With the energy density in both components 
important at low redshift increasing, H(z) increases for 
all redshifts in the low-redshift (matter-dominated and 
later) regime. 

Note that if we can ignore changes to the limit of in- 
tegration in Eq. [2]then we expect r s (z*) oc (1 + Z^) -0 ' 5 . 
With this scaling, we then expect Da{z*) oc (1 + f u )~ ' 5 
to keep 9 S fixed. The only way to achieve that scaling 
in the A CDM model is to have the dark energy density 
wa scale in exactly the same way as ui m . Hence H(z), 
and therefore Da(z) scale the same way at all redshifts. 
If Zdrag also has no variation with N e g then the net re- 
sult is that the BAO observables D^(z)/r a (zdtae) and 
H(z) * r^Zdrag) have no dependence on N e g. Note that 
Zdrag is the epoch of photon-baryon decoupling defined 
as in Meiksin et al. 

In practice there are many corrections to the above 
analysis, e.g., the correlation between Zdrag and N e g, but 
they are all small and the net result is very little depen- 
dence of the BAO observables on N e g. We can see that in 
Table [nj where adding in just BAO data makes very lit- 
tle difference to the N e g inference. Another consequence 
of uj\ scaling similarly as ui m is that does not scale 
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FIG. 8: Left panel: Contours of constant probability indicating the 68% and 95% confidence regions in the N e a - Dv(z = 0.57) 
plane. The solid line shows the major correlation direction of Dv(z = 0.57) oc (1 + f v )~ ' 58 with the index best-fitted from 
the Markov chain. Middle panel: Probability contours in the iVeff - r s (zdrag) plane; the solid line holds the best-fit correlation 



direction r s (zdr 
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Right panel: the relationship between N e s and the ratio, Dv/r s - The latest BOSS BAO 



data point 20 is plotted with the dashed line showing its la error, which is re-scaled by multiplying by the r s -rescaling factor 
f54.66/150.82. 



with N e fi. 

Fig. [8] quantitatively supports the above discussion. 
We compute the quantity Dy = (D^cz/ 'H(z)) 1 ^ 3 at the 
redshift z = 0.57 where the latest BAO data were ef- 
fectively measured [2D]. We see that D v (z — 0.57) oc 
(1 + U)-°- 58 , r s (z drag ) oc (1 + /,)-°- 55 ; i.e., they both 
scale nearly as (1 + f»)~ ' 5 - And their scalings are 
so similar that the ratio of the two quantities shows 
no noticeable correlation with N c g. In the case of 
WMAP7+K11+BAO+H , we get a(N cS ) = 0.41, com- 
pared to a(N c s) = 0.42 in Kll [4] for our baseline model. 
The only difference comes from the latest BAO data point 
which shows a little tension with the Dy/r s inference 
from the CMB data[BT]. 

What is sensitive to N e g at low redshift are absolute 
distance measures that are not calibrated with r s . The 
inference of Ho, given ACDM (with standard radiation 
content) calibrated with the WMAP7 and SPT data is 
h = 0.710 ± 0.021. This is 1.17-er lower than the Riess 
et al. 2011 value of h = 0.738 ± 0.024 [H]. Increasing 
N e s to the higher value preferred by the CMB data alone 
brings these two inferences into better agreement. One 
can see the impact of the Riess et al. measurement on 
the reduced uncertainty in N e g in the Table entries for 
the ACDM + N e s cosmology; including the Riess et al. 
measurement drops the uncertainty by about 50%. 

Varying N e g thus changes the ACDM, CMB-calibrated 
predictions for H , without much change in the predic- 
tions for the BAO data - consistent with the analysis in 
Eisenstein and White [46] . Increasing N e s thus can elim- 
inate the tension between the BAO data and the Riess et 
al. H measurement as pointed out in Mehta et al. [ID] . 

The case of dn s /d\nk free allows us to see this pref- 
erence of the low-redshift data for the H(z) that comes 
from an increased N e g. With CMB data alone, letting 



the running vary leads to a downward shift in the central 
value for Neg. Adding in the Ho and BAO measurements 
shifts the preferred value for N e g back up towards 4. 

Increasing N e g also has implications for the growth 
of structure. Increasing N e g leads to an increase in uj m 
which in turn decreases u)b/oj m . Decreasing the baryon 
fraction decreases the pressure support felt by matter 
prior to recombination, thereby boosting the growth of 
structure on scales smaller than the sound horizon at re- 
combination; i.e., scales smaller than about 150 Mpc |47j . 
Therefore increasing N e g increases cluster abundances, 
that are sensitive to the power spectrum amplitude on 
r~> 10 Mpc scales. This effect can be seen in joint esti- 
mates of m v (which has the opposite impact on small- 
scale power) and N c g from cluster abundances in Benson 
et al. [48] and also in [49j. 



V. CONCLUSION 

There are several ways that massless neutrinos im- 
pact the anisotropy of the cosmic microwave background. 
Here we have shown that current CMB constraints on 
iV e ff are dominated by the impact of the neutrino energy 
density on the expansion rate. Although 6 S is sensitive 
to this change in expansion rate from N e g, its simultane- 
ous sensitivity to the distance to last scattering greatly 
limits how well N e g can be reconstructed from it alone. 
Measuring the damping tail region has allowed a mea- 
surement of 6d as well. Since the response of Qd to the 
expansion rate is different from the response of 9 S , their 
ratio (which is independent of the distance to last scat- 
tering) is sensitive to the expansion rate. 

The above analysis assumes that Yp follows the BBN 
consistency relation. Since Yp also alters 6d, allowing it 
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to vary freely introduces a near degeneracy between 7V e ff 
and Yp. This near degeneracy is broken by a number of 
physical effects at low iV cff , including the early ISW effect 
and effects of a high baryon fraction. At high N e g the 
acoustic oscillation phase shifts probably play an impor- 
tant role, although we have not quantitatively confirmed 
this hypothesis. 

We defined a phenomenological scaling parameter of 
the early ISW effect, A c isw- We used it to show that 
the ISW effect plays a very small role for the case of 
BBN consistency, although, as just noted above, it does 
contribute to constraints on N e g when Yp is allowed to 
vary freely. We found that, assuming BBN consistency, 
Asisw = 0.979 ± 0.055 - a highly significant (though 
model dependent) detection of the early ISW effect. 

We tested the consistency of inferences of uib, N e g and 
Yp from CMB data, with inferences from light element 
abundance measurements and BBN theory. We see no 
strong evidence for any inconsistency, though the bounds 
are quite loose. For the most discrepant case we find 
n vec_ n bbn = o.84lg;?|. We note that a simultaneous 
inference of N^ S BN and N$p c from the CMB and the 
new Pettini & Cooke D/H measurement results in an 
N®fP N more consistent with 4 than is the case for the 
similar exercise performed in Pettini and Cooke [40] . 

We considered the impact of low-redshift distance- 



redshift relation measurements on determination of N e g. 
We found that BAO data are not very sensitive to N e g 
because Dy and r s scale similarly with N e g. The same 
cancellation does not occur for distance measures that 
calibrate independent of the CMB, such as the Hubble 
constant determination in Riess et al. [18] , 

We will have tighter measurements from the forthcom- 
ing analysis of the entire SPT survey, and we expect 
improvements to come from Planck in early 2013. We 
project that the error on N e g will reduce to ~ 0.33 using 
simulated full-survey SPT data combined with existing 
WMAP and Hq data, and will reduce further to ~ 0.20 
using simulated Planck data and existing Hq data, as- 
suming the Planck data and foreground model of Millca 
et al. [50] , consistent with the forecasts of Bashinsky and 
Seljak [22] and Hannestad et al. |51j . 
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